Combination of temporal trajectory filtering and projection measure for robust speaker identification
نویسندگان
چکیده
This paper presents a method that combines the techniques of temporal trajectory filtering and projection measure for robust speaker identification. The proposed robust feature, called Relative Autocorrelation Sequence Mel-scale Frequency Cepstral Coefficients (RAS-MFCC), is derived based on filtering the temporal trajectories of short-time one-sided autocorrelation sequences. This filtering process can minimize the effect of additive noise in the noisy speech. Since the norm of RAS-MFCC shrinks due to noise corruption, the projection measure (PM) technique, which is effective in dealing with the norm shrinkage of cepstrum, can be applied for the distance measure of RAS-MFCCs. The combination of these two techniques is then applied to a task of speaker identification of 100 speakers. Our experiment shows that the use of RASMFCC feature achieves significant improvement in identification rate as comparing with the use of MFCC. The combination of RAS-MFCC feature with PM technique can further improve the recognition accuracy.
منابع مشابه
An Adaptive-Robust Control Approach for Trajectory Tracking of two 5 DOF Cooperating Robot Manipulators Moving a Rigid Payload
In this paper, a dual system consisting of two 5 DOF (RRRRR) robot manipulators is considered as a cooperative robotic system used to manipulate a rigid payload on a desired trajectory between two desired initial and end positions/orientations. The forward and inverse kinematic problems are first solved for the dual arm system. Then, dynamics of the system and the relations between forces/momen...
متن کاملRobust Sliding Mode Controller for Trajectory Tracking and Attitude Control of a Nonholonomic Spherical Mobile Robot
Based on dynamic modeling, robust trajectory tracking control of attitude and position of a spherical mobile robot is proposed. In this paper, the spherical robot is composed of a spherical shell and three independent rotors which act as the inner driver mechanism. Owing to rolling without slipping assumption, the robot is subjected to two nonholonomic constraints. The state space representatio...
متن کاملA Family of Selective Partial Update Affine Projection Adaptive Filtering Algorithms
In this paper we present a general formalism for the establishment of the family of selective partial update affine projection algorithms (SPU-APA). The SPU-APA, the SPU regularized APA (SPU-R-APA), the SPU partial rank algorithm (SPU-PRA), the SPU binormalized data reusing least mean squares (SPU-BNDR-LMS), and the SPU normalized LMS with orthogonal correction factors (SPU-NLMS-OCF) algorithms...
متن کاملSpectro-temporal features for robust far-field speaker identification
Features derived from an auditory spectro-temporal representation of speech are proposed for robust far-field speaker identification. The auditory representation is obtained by first filtering the speech signal with a gammatone filterbank. A modulation filterbank is then applied to the temporal envelope of each gammatone filter output. Compared to commonly used mel-frequency cepstral coefficien...
متن کاملRobust Speech Recognition Features Based on Temporal Trajectory Filtering and Non-Uniform Spectral Compression
This paper proposes a new feature extraction method based on temporal trajectory filtering and nonuniform spectral compression and examines its performance with two tasks in noisy environments. Temporal trajectory filtering is effective for robust speech recognition in noisy environments, due to human hearing is more sensitive to relative values rather than absolute values and the effect of add...
متن کامل